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MEG1 ENDOSPERM-SPECIFIC PROMOTERS AND GENES 

The present invention relates to controlling the expression of genes 
during the development of the endosperm. It concerns in particular nucleic acid 
5 molecules which enable expression which is both specific to the endosperm 
and early during the development of the endosperm, as well as nucleic acid 
molecules encoding basal endosperm transfer cell layer (BETL) proteins. 

The endosperm, a characteristic formation of Angiosperm seeds, is a 
nutritive tissue for the embryo. The maize endosperm originates with series of 
10 free-nuclear divisions, followed by cellularisation and the subsequent formation 
of a range of functional cellular domains. This tissue is complex in its structure 
and development, in particular in cereals. 

The endosperm is the main storage organ in maize seeds, nourishing 
the embryo while the seed develops, and providing nutrients to the seedling on 
15 germination. Thus, the uptake of assimilates by the growing endosperm is a 
critical process in seed development 

The central area of the endosperm consists of large cells with 
vacuoles, which store the reserves of starch and proteins, whilst the region 
surrounding the embryo is distinguished by rather small cells, occupied for the 
20 major part by cytoplasm . 

The Basal Endosperm Transfer Layer (BETL) area is highly 
specialized to facilitate uptake of solutes during grain development. These 
transfer cells of the basal endosperm have specialised internal structures 
adapted to absorb solutes from the maternal pedicel tissue, and translocate 
25 these products to the developing endosperm and embryo. Usually, BETL genes 
are expressed between 8 to 20 days after pollination (DAP). 

The international patent application WO 99/50427 reports Basal 
Endosperm Transfer cell Layer genes and regulatory sequences thereof. 

However, there is still a need for regulatory sequences specific to the 
30 endosperm. 

The authors of the present invention have now isolated novel promoter 
nucleotide sequences that enable expression of the coding sequences to which 
they can be linked, which is specific to the endosperm in Angiosperm seeds 
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and which intervenes particularly in the early stage of the development of the 
endosperm. These novel promoter nucleotide sequences are useful as an 
alternative to the few promoters described in the literature. 

Such promoter sequences are particularly useful for targeting or 
5 regulating the expression of genes of interest 

Interestingly, the nucleotide sequences having promoter activity 
according to the invention have a maternal parent-of-origin pattern of 
expression so the genes under transcriptional control of these promoters are 
preferentially expressed through the maternal allele during early endosperm 
10 development. 

Advantageously, these nucleotide sequences having promoter activity 
are active as early as 4 DAP, when maternally inherited. 

Surprisingly, the authors found that the proteins encoded by the genes 
related to these promoter sequences are implicated in the mechanism of 
15 pathogen resistance and/or endosperm development. 

The present invention relates to an isolated nucleic acid molecule 
having promoter activity specific to the endosperm that comprises a DNA 
sequence selected from the group consisting of : 
20 (a) a sequence as depicted in any one of SEQ ID No: 1 to 3 

(promoter MEG1-1 (pMEG1-1), promoter MEG1-2 (pMEG1-2), and promoter 
MEG1-3 (pMEG1-3), respectively); 

(b)a fragment of a sequence as defined in (a), wherein said 
sequence has promoter activity specific to the endosperm; 
25 (c)a sequence that has at least 70 % sequence identity with a 

sequence as defined in (a), wherein said sequence has promoter activity 
specific to the endosperm; 

(d) a sequence hybridizing with the complementary strand of a 
sequence as defined in (a) and/or (b) under stringent conditions, wherein said 

30 sequence has promoter activity specific to the endosperm; and 

(e) a sequence that comprises a nucleotide sequence which is 
conserved among at least two of SEQ ID No: 1 to 3. 
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As used herein, the term "conserved" sequence refers to a nucleotide 
stretch that shows 100% identity between at least two sequences of reference. 
Preferably said conserved sequence is found in all of the three sequences SEQ 
ID No: 1 to 3. A sequence alignment of SEQ ID No: 1 to 3 is shown in Figure 
5 14. 

According to an embodiment, said conserved sequence comprises 10 
contiguous nucleotides from SEQ ID No: 1, SEQ ID No: 2 or SEQ ID No: 3, 
preferably 15, 20, 25, or 30 contiguous nucleotides. Preferably, the sequence 
as defined in alternative (e) comprises 

10 5' ATATAGATAGATATAGCAAATTCACCAAATAATATAG 3 1 (SEQ ID No: 57). 
More preferably said sequence consists in SEQ ID No: 57. However, any 
conserved sequence, or a combination of conserved sequences, as shown in 
Figure 14 is within the scope of the invention. 

According to another embodiment, said sequence as defined in 
15 alternative (e) comprises a motif of 9 contiguous nucleotides from SEQ ID No: 
1, SEQ ID No: 2 or SEQ ID No: 3, wherein nucleotides 1-3 and 7-9 of said motif 
are conserved among at least two, preferably three, of SEQ ID No: 1 to 3, and 
nucleotides 4-6 may be variable. 

Particularly, the nucleic acid molecule according to the invention 
20 enables expression (promoter activity) which is both specific to the endosperm, 
more particularly to the BETL, and early during the development of the 
endosperm. 

"Promoter activity specific to the endosperm" means, as used in the 
present invention, that the promoter is predominantly expressed in the 
25 endosperm, and preferably exclusively expressed in the endosperm. 

"Early during the development of the endosperm" means that 
expression of the promoter occurs from the very first days after pollination, 
preferably as early as the fourth day after pollination. 

The various nucleotide sequences of the invention can be of artificial 
30 origin or not. They may be DNA sequences obtained by screening banks of 
sequences by means of probes produced on the basis of SEQ ID No: 1,2,3, 5, 
7, 9, 11, 13, and 15. Such banks can be prepared by conventional techniques 
of molecular biology, known to persons skilled in the art. 
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The nucleotide sequences according to the invention can also be 
prepared by chemical synthesis, or by mixed methods including the chemical or 
enzymatic modification of sequences obtained by screening banks. According 
to an embodiment of the invention, a nucleic acid molecule having such a 

5 promoter activity consists in SEQ ID No: 1 (pMEG1-1), SEQ ID No: 2 (pMEG1- 
2), or SEQ ID No: 3 (pMeg1-3). 

By reference to these promoter sequences, a " fragment n denotes a 
sequence, particularly a DNA sequence, which has a reduced length with 
regard to said sequence of reference. 

10 Such a fragment, according to the invention, has promoter activity 

specific to the endosperm, preferentially specific to the BETL, and contains at 
least 15, 25, 35, 37, 50, 75, 91, 95, 100, 121, 122, 127, 150, 200, 220, 240, 
250, 270, 300, 350, 360 or 365 consecutives nucleotides of the DNA sequence 
of reference. 

15 A preferred fragment contains the region defined by nucleotides 1 to 

121 of SEQ ID No: 1, which corresponds to the sequence shown in SEQ ID No: 
4, or contains the region defined by nucleotides 1 to 127 of SEQ ID No: 1, 
which corresponds to the sequence shown in SEQ ID No: 59. 

Are also preferred fragments the regions defined by nucleotides 26 to 

20 1 20 of SEQ ID No:4 and 29 to 1 1 9 of SEQ ID No:4. 

A fragment according to the invention may also be obtained by fusion 
of at least two consecutive regions of a DNA sequence of reference. An 
example of fusion fragment according to the invention may comprise the region 
defined by nucleotides 1-121 of SEQ ID No: 1, which is required for pMEG1-1 

25 activation by ZmMRPI , and the region defined by nucleotides 244-370 of SEQ 
ID No: 1 , which contains a TATA box. 

It is further possible to make a fusion of the region defined by 
nucleotides 1-121 of SEQ ID No: 1 (Meg1 promoter region) to any minimal 
promoter region (such as a -46 CaMV 35S promoter) in order to produce a 

30 promoter that is liable to be activated by ZmMRPI and thus that is BETL- 
specific, the minimal promoter region being used to replace the region defined 
by nucleotides 244-370 from MEG1-1 promoter sequence (SEQ ID No: 1) in 
the fusion fragment defined above. 
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Such a result is also obtained using fragments 26 to 1 20 or 29 to 1 1 9 of 
SEQ ID No: 4. 

A nucleic acid molecule "hybridizes" to another nucleic acid molecule, 
5 such as a cDNA, genomic DNA, or RNA, when a single stranded form of the 
nucleic acid molecule can anneal to the other nucleic acid molecule under the 
appropriate conditions of temperature and solution ionic strength (see 
Sambrook et al. f 1989). 

Such an hybridizing sequence has promoter activity specific to the 
10 endosperm according to the invention, preferentially specific to the BETL, and 
contains at least 15, 25, 35, 37, 50, 75, 100, 121, 122, 127, 150, 200, 220, 240, 
250, 270, 300, 350, 360 or 365 nucleotides. Preferably, said hybridizing 
sequence may contain no more than 100 or 200 nucleotides. 

The invention also encompasses modifications of the DNA sequences 
15 as depicted in SEQ ID No: 1 to 3 (pMEG1-1, pMEG1-2, pMEG1-3), or of 
sequence motifs thereof by, e. g., nucleotide replacements that do not affect 
the overall structure or binding motif of the nucleic acid molecule having 
promoter activity specific to the endosperm so that it remains capable of driving 
endosperm specific expression of a gene, and more particularly, a BETL 
20 specific expression of a gene. 

"Homologous nucleic acid sequence", or "homologous DNA sequence", 
means any nucleic acid sequence which differs from any of the sequence SEQ 
ID No: 1 to 3 by a substitution, deletion and/or insertion of one or more 
nucleotides at positions such that these homologous nucleic acid sequences 
25 preserve the specificity property of promoters of sequences SEQ ID No: 1 to 3. 

Preferably such a homologous nucleic acid sequence is at least 70 % 
identical to one of the sequences SEQ ID No: 1 to 3 f preferably at least 85% 
identical, more preferably at least 90, 91, 95, 98, 99.9 % identical. Also 
preferably, the degree of identity is defined by comparison with the entire 
30 sequence of reference, SEQ ID No: 1 , SEQ ID No: 2 or SEQ ID No: 3. 

Homology is generally determined using a sequence analysis software 
(for example, the Sequence Analysis Software package of the Genetics 
Computer Group, University of Wisconsin Biotechnology Center, 1710 
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University Avenue, Madison, Wl 53705). Similar nucleotide sequences are 
aligned in order to obtain the maximum degree of homology (i.e. identity). To 
this end, it may be necessary to artificially introduce gaps in the sequence. 
Once the optimum alignment has been achieved, the degree of homology (i.e. 
5 identity) is established by recording all the positions for which the nucleotides of 
the two compared sequences are identical, with respect to the total number of 
positions. 

In a preferential manner such a homologous nucleic acid sequence 
specifically hybridizes to a sequence which is complementary to one of the 
10 sequences SEQ ID No: 1 to 3 under stringent conditions. The parameters 
defining the stringency conditions depend on the temperature at which 50% of 
the paired strands separate (Tm). 

For sequences comprising more than 30 bases, Tm is defined by the 
equation: Tm = 81.5 + 0.41 (%G+C) + 16.6 Log (concentration in cations) - 
15 0.63 (%formamide) - (600/number of bases) (Sambrook et aL, 1 989). 

For sequences shorter than 30 bases, Tm is defined by the equation: 
Tm = 4(G+C) + 2(A+T). 

Under appropriate stringency conditions, in which non-specific 
(aspecific) sequences do not hybridize, the temperature of hybridization is 
20 approximately between 5 and 30°C, preferably between 5 and 10°C below Tm 
and hybridization buffers used are preferably solutions of higher ionic force like 
a solution 6*SSC for example. 

In addition, it is possible to derive essential, regulatory elements from 
the promoters provided herein. Thus, those elements of the promoter sequence 
25 responsible for both its function as a promoter and, more importantly, its 
endosperm specificity, particularly its BETL specificity, can be isolated and 
incorporated into nucleic acid molecules which, although not falling within the 
definitions (a) to (e) above, nonetheless still function in an equivalent manner. 

The present invention also provides a recombinant or isolated nucleic 
30 acid molecule comprising, or consisting of, one or more regulatory elements 
derived from any one of the sequences (a) to (e) and which is capable of 
driving expression specifically in the endosperm, and preferably at an early 
stage of the development of the endosperm. 
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According to an embodiment of the invention, the nucleic acid molecule 
having promoter activity specific to the endosperm preferentially has a maternal 
parent-of-origin pattern of expression. As used herein, the term "maternal 
parent-of-origin pattern of expression" refers to the expression of a gene 
5 maternally imprinted. Imprinting is an epigenetic mechanism by which a 
particular allele is silenced according to its parental origin. This means that 
genes subjected to genomic imprinting have monoallelic expression, instead of 
biallelic expression as usual for autosomal genes, which is based on the sex of 
the transmitting parent 

10 The isolation of promoters that are regulated via imprinting provides an 

additional level of control compared to normal, non-imprinted promoters. 
Accordingly, the level of expression of a gene of interest can advantageously 
be controlled via the direction of the genetic cross between one plant carrying 
an imprinted transgene and the other plant lacking the transgene. 

15 Preferably, the nucleic acid molecule having promoter activity specific 

to the endosperm which has a maternal parent-of-origin pattern of expression 
consists in SEQ ID No: 1. 

SEQ ID No: 62 is also a nucleic acid molecule having promoter activity 
specific to the endosperm which has a maternal parent-of-origin pattern of 

20 expression, and is a part of the invention as any of SEQ ID No: 1 to 3. 

The nucleic acid molecules having promoter activity specific to the 
endosperm according to the invention can be isolated from various plant 
species, notably Angiosperm plants, Monocotyledons or Dicotyledons and are 
preferably nucleic acid molecules isolated from a plant selected from the group 

25 consisting of maize, teosintes, rice, sorghum, wheat, barley, rye, pea, and 
sugar cane. Still preferably, the plant is maize. 

It is possible for the person skilled in the art to isolate with the help of 
promoter sequences of the invention, corresponding genes from other species 
("orthologous" genes). 

30 This can be done by conventional techniques known In the art, for 

example, by using a promoter sequence depicted in any one of SEQ ID No: 1 
to 3 as a hybridization probe or by designing appropriate PCR primers. 
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It is preferable to start with coding DNA sequences or Protein 
sequences via TBLASTN queries. The approach used to isolate rice Meg1 
promoters, for example, is to use the Protein sequence of ZmMeg1-1, 
preferably the most conserved region identified via an alignment of all the Meg 
5 proteins, do a TBLASTN with this sequence against Rice ESTs, then use this 
EST to find the genomic sequence or directly use TBLASTN against the rice 
genome sequence. The same approach is used to isolate wheat Meg1 
promoters. 

It is then possible to isolate the corresponding promoter region by 
10 conventional techniques and test it for its expression pattern by known 
techniques (e.g. reporter gene analysis). 

Another object of the present invention is a nucleotide construction, 
referred to as an expression cassette, comprising a nucleic acid molecule 
having promoter activity specific to the endosperm as defined above, 
15 operatively linked to at least one gene of interest. 

"Operatively linked" refers to functional linkage between a nucleic acid 
molecule having promoter activity according to the invention and the gene of 
interest. 

The gene of interest can be of a heterologous origin. 
20 The gene of interest can be placed in the sense or antisense 

orientation. 

According to an embodiment, the gene of interest may be selected 
from the group consisting of a sequence that encode a peptide or a protein, an 
antisense RNA sequence, a sense RNA sequence, both a sense and antisense 
25 RNA sequence and/or a ribozyme. 

Preferentially, the gene of interest is a sequence that codes for a 
protein or for a peptide. 

The said gene of interest can for example code for a protein involved in 
the development of the embryo and/or of the endosperm, the determination of 
30 seed size and/or quality (e.g. MRP1 or Ferretin (Lobreaux S. et al. 1992)), cell 
growth (proteins regulating cell division including cytokinin or auxin genes, e.g. 
ipt (Zhang et al. 1995), the flow of nutrients or nutrient transfer (transporters 
(Bolchi A. et al. 1999)), proteins involved in fatty acids metabolism. The gene of 
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interest may also encode an enzyme involved in sugar metabolism such as 
invertases (e.g. incW2 (Taliercio EW et al. 1999)), sucrose synthases (e.g. 
Sh1), the saccharose phosphate synthase, saccharose synthase, UDP- 
Glucose pyrophosphorylase, ADP-glucose pyrophosphorylase (Thomas W. 
5 Greene et al. 1998), starch branching enzyme (Ming Gao et al. 1997) or the 
starch synthase (Mary E. Knight et al. 1998). The gene of interest could also 
code for a hexokinase as the one described by Jang et al. (1997) in order to 
improve grain filling. The gene of interest may additionally code for a protein 
that is involved in amino acids transfer, such as a methionine permease or a 

10 lysine permease, or a sulphur transporter etc. It can also code for a toxic 
protein such as Bamase, for a protein activating or inhibiting other genes, such 
as transcriptional regulators including transactivators modified to act as 
dominant activators or repressors of transcription (e.g. fusions to the engrailed 
domain (Poole et al., 1985) or co-repressors for example), or for a protein 

15 improving resistance to pathogens (e.g. BAP2, MRP1 ). 

Preferably, said gene of interest encodes a protein selected from : 

- a protein whose specific expression in the endosperm, and particularly in 
the BETL, makes it possible to increase nutrient uptake and thus seed size 
and/or quality; examples of such a protein include an invertase like Incw2 or 

20 like Ivr1 (EP 0 442 592), a sucrose synthase like Sh1 (WO 02/067662) or any 
transporters of sugar and nitrogen or a MRP1 protein etc; 

- a protein that improves resistance to pathogens; examples of such a 
protein include a BAP Protein (Basal Layer Antifungal Protein) (Sema et al., 
2001), or anti-fungal peptides, or a MRP1 protein or a protein that encodes an 

25 oxalate oxidase (WO 92/15685) or a protein that encodes a chitinase 
(WO 92/01792 or US 5, 446, 138) or a protein that encodes a glucanase 
(WO 93/02197) etc. 
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A protein that "improves resistance to pathogens" or "a protein 
improving resistance to pathogens" means a protein that, when expressed in a 
plant or a part of a plant, confers or improves resistance to pathogens to said 
plant, or part thereof. Said transformed plant has a better resistance to 
5 pathogens than the non-transformed plant (wild-type). 

The said gene of interest can also be associated with other regulating 
elements such as transcription termination sequences (terminators). By way of 
examples of such sequences, it is possible to cite the polyA 35S terminator of 
the cauliflower mosaic virus (CaMV), described in the article of Franck et al. 
10 (1 980) and the NOS terminator corresponding to the region in the non-coding 3' 
region of the nopaline synthase gene of the Ti-plasmid of the Agrobactehum 
tumefaciens nopaline strain (Depicker et al. 1992). 

Preferably, the terminator used is the 3'CaMV. 

According to the invention, the expression cassette, comprising a 

15 nucleic acid molecule having promoter activity specific to the endosperm as 
defined above, operatively linked to at least one gene of interest may further 
comprise one or several selection marker genes for plants, useful for 
transformation and selection. 

In the present invention, the term "selectable marker", "selectable 

20 gene", "selectable marker gene", "selection marker gene", "marker gene" are 
used interchangeably. 

These selectable markers include, but are not limited to, antibiotic 
resistance genes, herbicide resistance genes or visible marker genes. Other 
phenotypic markers are known in the art and may be used in this invention. 

25 A number of selective agents and resistance genes are known in the 

art (See, for example, Hauptmann et al., 1988; Dekeyser et al., 1988; 
Eichholtzetal., 1987 ; and Meijeretal., 1991). 

Notably the selectable marker used can be the bar gene conferring 
resistance to bialaphos (White et al., 1990), the sulfonamide herbicide Asulam 

30 resistance gene, sul (described in WO 98/49316) encoding a type I 
dihydropterate synthase (DHPS), the nptll gene conferring resistance to a 
group of antibiotics including kanamycin, G418, paromomycin and neomycin 
(Bevan et al., 1983), the hph gene conferring resistance to hygromycin (Gritz et 
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al., 1983), the EPSPS gene conferring tolerance to glyphosate (US 5,188,642), 
the HPPD gene conferring resistance to isoxazoles (WO 96/38567), the gene 
encoding for the GUS enzyme, the green fluorescent protein (GFP), expression 
of which, confers a recognisible physical characteristic to transformed cells, the 
5 chloramphenicol transferase gene, expression of which, detoxifies 
chloramphenicol. 

Advantageously, the selectable marker gene is inserted between a 
promoter and a terminator in a second expression cassette. Said second 
expression cassette being integrated in the same vector as the expression 

10 cassette containing the gene of interest under transcriptional control of a 
promoter according to the invention. 

According to this advantageous embodiment, the marker gene is 
preferably controlled by a promoter which allows expression in cells, thus 
allowing selection of cells or tissue containing the marker at any stage of 

15 development of the plant Preferred promoters are the promoter of nopaline 
synthase gene of Agrobacterium, the promoter derived from the gene which 
encodes the 35S subunit of cauliflower mosaic virus (CaMV) coat protein, and 
the rice actin promoter. However, any other suitable second promoter may be 
used. 

20 Any terminator may be used. Preferred terminators are the 3'CaMV 

and Nos terminator as previously described. 

Advantageously, the expression cassette containing the selectable 
marker gene is comprised between two Ds elements (transposons) in order for 
its removal at a later stage by interacting with the Ac transposase. This 
25 elimination system is described in Yoder et al. (1 993). 

For the transformation step, two vectors could be used, the first one 
comprising the expression cassette containing the gene of interest and the 
second one comprising the expression cassette containing the selectable 
marker gene. The same host cell being transformed with these two vectors (co- 
30 transformation). 

The expression cassettes according to the invention may additionally 
contain transit peptide sequences. There are numerous examples in the art of 
transit peptides which may be used to deliver a target protein into a plastid 
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organelle such as the small subunit (SSU) transit peptide of ribulose 
biphosphate carboxylase. 

Other elements like introns and enhancers can also be present in the 
nucleic sequence of interest in order to improve the expression of the gene of 
5 interest 

Among useful introns, the first intron of maize adhlS can be placed 
between the promoter and the coding sequence. This intron when included in a 
gene construct increased the expression of the desired protein in maize cells 
(Callis et al. v 1987). One also can use the 1 st intron of the shrunken 1 gene of 

10 the maize (Maas et al. a 1991), the 1 st intron of the catalase gene of the bean 
catalase (CAT-1) (Ohta et al., 1990), the 2 nd intron of the ST-LS1 gene of 
potato (Vancanneyt et al. 1990), the DSV intron of the yellow dwarf virus of 
tobacco (Morris et al., 1992), the actin-1 intron (act-1) of rice (McElroy et al., 
1990) and intron 1 of triosephosphate isomerase (TPI) (Snowdon et al., 1996). 

15 Preferentially, the intron used in the present invention is the Hsp70 intron or the 
Sh1 intron. 

The expression cassettes may additionally contain 5' leader 
sequences. Such leader sequences can act to enhance translation. Such 5' 
leaders are known in the art and include, but are not limited to, picornavirus 

20 leaders, for example, the EMCV leader (Encephalomyocarditis 5* noncoding 
region) (Elroy-Stein, Fuerest, and Moss B., 1989); potyvirus leaders, for 
example, the TEV leader (Tobacco etch Virus) (Allison et al., 1986) ; the human 
immunoglobulin heavy-chain binding protein leader (BiP) (Macejack and 
Sarnow, 1991) ; the untranslated leader from the coat protein mRNA of alfalfa 

25 mosaic virus (AMV RNA 4) (Jobling and Gehrke, 1987) ; the tobacco mosaic 
virus leader (TMV) (Gallie et al., 1989) ; and the maize chlorotic mottle virus 
leader (MCMV) (Lommel et al., 1991). See also, Della-Cioppa et al. (1987). 
Other methods known to enhance translation can be utilized, for example 
introns, and the like. 

30 Any expression cassette according to the invention, may further 

comprises a gene encoding a MRP1 protein, preferably the ZmMRPI protein. 

The ZmMRPI protein is described in Gomez et al (2002). The nucleic 
acid sequence and amino acid sequence of ZmMRPI are shown in SEQ ID 
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No:51 and 52, respectively. However, any protein homologous to MRP1, and 
particularly to ZmMRPI , that have the same expression pattern can be used. 

Such MRP1 protein can be used according to different ways : 
- MRP1 as anti-fungal protein (resistance to pathogens) operatively linked to 
5 MEG1 promoter sequence SEQ ID No: 1 to 3, or a fragment thereof : 

A first expression cassette comprising an MRP1 protein or a fragment 
thereof, preferably the ZmMRPI protein, operatively linked to a MEG1 
promoter, preferably ZmMEG1-1 promoter, and to a terminator and a second 
expression cassette comprising a marker gene inserted between a promoter 
10 and a terminator ; these two expression cassettes being inserted in the same 
vector for transformation or these two cassettes being inserted in two different 
vectors being co-transformed as previously described. 

- MRP1 as transactivator for MEG1 promoter : 

A first expression cassette comprising a gene of interest, operatively 
15 linked to a MEG1 promoter, preferably ZmMEG1-1 promoter, and to a 
terminator ; a second expression cassette comprising a MRP1 protein, 
preferably the ZmMRPI protein inserted between a promoter and a terminator ; 
a third expression cassette comprising a marker gene inserted between a 
promoter and a terminator ; these three expression cassettes being inserted 
20 into the same vector for transformation or these three expression cassettes 
being inserted into three different vectors being co-transformed as previously 
described. 

This second way of use is a binary system of expression, i.e. two 
plants are crossed, the first one with a gene of interest operatively linked to 

25 MEG1 promoter (preferably ZmMEG1-1) and the other with for example a leaf 
promoter linked to MRP1. The result is that in the hybrid the expression of the 
gene of interest is not only present in the BETL but also in the leaf. 

In preparing the expression cassettes, the various DNA sequences or 
fragments may be manipulated, so as to provide DNA sequences or fragments 

30 in the proper orientation and, as appropriate, in the proper reading frame. 
Towards this end, adapters or linkers may be employed to Join the DNA 
fragments and/or other manipulations may be required to provide convenient 
restriction sites, removal of superfluous DNA, removal of restriction sites, or the 
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like. For this purpose, in vitro mutagenesis, primer repair, restriction, annealing, 
ligation, PCR, or the like may be employed, where nucleotide insertions, 
deletions or substitutions, for example transitions and transversions, may be 
involved. These techniques are well known by those skilled in the art. 
5 Another object of the invention is any nucleotide vector referred to as 

an expression vector, such as a plasmid, which can be used for transforming 
host cells, characterized in that it contains at least an expression cassette as 
defined above. The construction of expression vectors for the transformation is 
within the capability of one skilled in the art following standard techniques. 

10 The decision as to whether to use a vector, or which vector to use, is 

guided by the method of transformation selected, and by the host cell selected. 

Where a naked nucleic acid introduction method is used, then the vector 
can be the minimal nucleic acid sequences necessary to confer the desired 
phenotype, without the need for additional sequences. 

15 Possible vectors include the Ti plasmid vectors, shuttle vectors designed 

merely to maximally yield high numbers of copies, episomal vectors containing 
minimal sequences necessary for ultimate replication once transformation has 
occured, transposon vectors, including the possibility of RNA forms of the gene 
sequences. The selection of vectors and methods to construct them are 

20 commonly known to persons of ordinary skill in the art and are described in 
general technical references (Mullis, KB (1987), Methods in Enzymology). 

For other transformation methods requiring a vector, selection of an 
appropriate vector is relatively simple, as the constraints are minimal. The 
apparent minimal traits of the vector are that the desired nucleic acid sequence 

25 be introduced in a relatively intact state. Thus, any vector which produces a 
plant carrying the introduced DNA sequence should be sufficient. Also, any 
vector which introduces a substantially intact RNA which can ultimately be 
converted into a stably maintained DNA sequence should be acceptable. 

However, any additional attached vector sequences which confer 

30 resistance to degradation of the nucleic acid fragment to be introduced, which 
assists in the process of genomic integration or provides a means to easily 
select for those cells or plants which are actually, in fact, transformed are 



WO 2005/042745 



PCT/EP2004/052760 



advantageous and greatly decrease the difficulty of selecting useable 
transgenic plants. 

The vector can exist, for example, in the form of a phage, a plasmid or 
a cosmid. The construction of such expression vectors for transformation is well 
5 known in the art and uses standard techniques. Mention may be made of the 
methods described by Sambrook et al. (1989). 

Another object of the invention is a host cell, containing at least an 
expression vector as described above. 

The decision as to whether to use a host cell, or which host cell to use, 
10 is guided by the method of transformation. 

The host cell can be any prokaryotic or eukaryotic cell. Any of a large 
number of available and well-known host cells may be used in the practice of 
this invention. The selection of a particular host is dependent upon a number of 
factors recognized by the art. These include, for example, compatibility with the 
15 chosen expression vector, bio-safety and costs. Useful hosts include bacteria 
such as E. coli sp. or Agrobacterium. A plant host cell, may be also used, 
notably an Angiosperm plant cell, Monocotyledon as Dicotyledon plant cell, 
particularly a cereal or oily plant cell, selected in particular from the group 
consisting of maize, wheat, barley, rice, rape and sunflower, preferentially 
20 maize. 

More particularly, the host cell used in carrying out the invention is 
Agrobacterium tumefaciens, according to the method described in the article of 
An et al., 1986, or Agrobacterium rhizogenes, according to the method 
described in the article of Jouanin et al., 1987. 

25 The invention also concerns a transgenic plant or part of a transgenic 

plant, in particular fruit, seed, grain or pollen, comprising such a cell or 
generated from such a cell. Where the plant contains endogenously a MEG1 
promoter or gene according to the invention, it will be understood that the 
transgenic plant according to the invention comprises an additional 

30 "exogenous" gene MEG1 gene or promoter, for instance integrated by 
transgenese. 

A whole plant can be regenerated from a single transformed plant cell. 
Thus, in a further aspect the present invention provides transgenic plants (or 
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parts of them) including nucleic acid sequences in accordance with the 

invention. The regeneration can proceed by known methods. 

The seeds which grow, by fertilization, from this plant also contain this 

transgene in their genome. 
5 Advantageously, the transgenic plant obtained can produce grains with 

a larger endosperm in comparison with a non-transformed plant, particularly 

grains with starch, oil contents or protein contents which are modified in 

comparison with a non-transformed plant. 

A plant or part of a plant according to the invention could be a plant or 
10 a part of it from various species, notably an Angiosperm, Monocotyledons as 

Dicotyledons, preferably a cereal or oily plant, selected in particular from the 

group consisting of maize, rice, wheat, barley, rape and sunflower, 

preferentially maize. 

As used herein, the term "oily plant" denotes a plant that is capable of 
15 producing oil, and preferably that is cultivated for oil production. 

The hybrid plants obtained by crossing plants according to the invention also 

form part of the invention. 

An other object of the invention is a method of obtaining a plant having 

improved agronomic qualities and/or improved resistance to pathogen, 
20 comprising the steps consisting of : 

(a) transforming at least one plant cell by means of a at least a vector 
as defined previously ; 

(b) cultivating the cell(s) thus transformed so as to generate a plant 
containing in its genome at least an expression cassette according to the 

25 invention, whereby said plant has improved agronomic qualities and/or 

improved resistance to pathogen. 

According to the invention, "improved agronomic qualities" means 

improved agronomic qualities and/or improved nutritional qualities, notably 

yield, food or industrial qualities of a plant or a part thereof. Seed size, yield, 
30 seed number, seed composition are considered as elements conferring 

improved agronomic qualities to a plant as compared to a non-transformed 

plant (wild-type). 
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According to the invention, "improved resistance to pathogens" means 
that the transformed plant have a better resistance to pathogens than the non- 
transformed plant (wild-type). 

The transformation of vegetable cells can be achieved by any one of 

5 the techniques known to one skilled in the art 

It is possible to cite in particular the .methods of direct transfer of genes 
such as direct micro-injection into plant embryoids (Neuhaus et coll. 1997), 
vacuum infiltration (Bechtold at al. 1993) or electroporation (Chupeau et coll., 
1989) or direct precipitation by means of PEG (Schocher et coll., 1986) or the 

10 bombardment by gun of particules covered with the plasmidic DNA of interest 
(Fromm M et al., 1990). 

It is also possible to infect the plant with a bacterial strain, in particular 
Agrobacterium. According to one embodiment of the method of the invention, 
the vegetable cells are transformed by a vector according to the invention, the 

15 said cell host being able to infect the said vegetable cells by allowing the 
integration, in the genome of the latter, of the nucleotide sequences of interest 
initially contained in the above-mentioned vector genome. Advantageously, the 
above-mentioned cell host used is Agrobacterium tumefaciens, in particular 
according to the method described in the article by An et ah, (1986), or 

20 Agrobacterium rhizogene, in particular according to the method described in 
the article by Guerche et al. (1987). 

For example, the transformation of vegetable cells can be achieved by 
the transfer of the T region of the tumour-inducing extra-chromosome circular 
plasmid of Agrobacterium tumefaciens, using a binary system (Watson et al., 

25 1994). To do this, two vectors are constructed. In one of these vectors the T 
region has been eliminated by deletion, with exception of the right and left 
borders, a marker gene being inserted between them to allow selection in the 
plant cells. The other partner of the binary system is an auxiliary plasmid Ti, a 
modified plasmid which no longer has any T region but still contains the 

30 virulence genes vir necessary to the transformation of the vegetable cell. 

According to a preferred mode, it is possible to use the method 
described by Ishida et al. (1996) for the transformation of Monocotyledons. 



WO 2005/042745 PCT/EP2004/052760 

18 

According to another protocol, the transformation is achieved 
according to the method described by Finer et al.(1 992) using the tungsten or 
gold particle gun. 

5 Selection : 

The engineered plant material may be selected or screened for 
transformants (those that have incorporated or integrated the introduced 
nucleotide construction(s)). Such selection and screening methodologies are 
well known to those skilled in the art The selection and screening method is 
10 chosen depending on the marker gene used. 

An isolated transformant may then be regenerated into a plant 

Regeneration : 

Normally, regeneration is involved in obtaining a whole plant from the 
15 transformation process. The term "regeneration" as used herein, means 
growing a whole plant cell, a group of plant cells, a plant part or a plant piece 
(for example, from a protoplast, callus, or tissue part). 

Methods of regenerating whole plants from plant cells are known in the 
art, and the method of obtaining transformed and regenerated plants is not 
20 critical to this invention. 

In general, transformed plant cells are cultured in an appropriate 
medium, which may contain selective agents such as antibiotics, where 
selectable markers are used to facilitate identification, of transformed plant 
cells. Once callus forms, shoot formation can be encouraged by employing 
25 appropriate plant hormones in accordance with known methods and shoots 
transferred to rooting medium for regeneration of plants. The plants may then 
be used to establish repetitive generations, either from seeds or using 
vegetative propagation techniques. 

The invention further relates to the use of at least an expression 
30 cassette as previously defined, for obtaining a transgenic plant exhibiting 
improved agronomic qualities and/or improved resistance to pathogen. 

The agronomic quality of a plant is improved by acting in particular on 
the size of the embryo or of the endosperm and/or its development. 
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This is because an early specific action on the development of the 
tissues of the embryo and of the endosperm can be sought : according to the 
relative size of one or other it would be possible to obtain seeds or fruits with a 
higher starch (large endosperm) and/or oil (large embryo) content, via the use 

5 respectively of stimulator genes (hormone of the cellular cycle for example) or 
inhibitor genes (toxic protein or transcription inhibitor for example). A reduction 
in BETL size would create a miniature type seed which could have a reduced 
endosperm/embryo ratio. The most likely effect of increasing BETL activity/size 
would be to increase endosperm size. This may translate into a bigger embryo 

10 also. 

By way of example, the use of genes coding for hormones (cytokinins, 
auxins) under the control of the promoters described according to the invention, 
would make it possible to modify the process of cellularisation and thus the 
development of the endosperm. 
15 An effect on the accumulation of nutrients in the embryo and 

endosperm can also be sought, by using for example, as genes of interest, 
genes coding for transporters of nutrients (sugar in particular), or genes coding 
for inhibitors of these transporters, leading to differential accumulation of 
nutrients in the endosperm or embryo. 
20 The invention also concerns the use of the transgenic plants obtained 

according to the invention, or parts of these plants, in particular seeds, grains, 
and fruits for preparing derived products, in particular food products. 

The invention relates to seeds obtained from a plant transformed with 
a nucleic acid sequence according to the invention (SEQ ID No: 1-4, 5, 7, 9, 1 1 , 
25 13, 15 and 58). 

The products obtained, whether it be seeds with a higher oil content, 
flours of seeds or grains with a higher starch, protein or oil content, also come 
within the scope of the invention. 

The invention also provides any composition for human or animal food 
30 prepared from the said obtained products. 

The invention further provides an isolated nucleic acid molecule 
encoding a plant basal endosperm transfer cell layer (BETL) protein, which 
comprises a sequence selected from the group consisting of : 
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(a) a nucleotide sequence encoding a protein consisting of an amino acid 
sequence as depicted in SEQ ID No: 6, 8, 10, 53, 12, 14 or 16 
(ZmMEG1-1, ZmMEG1-2, ZmMEG1-3 (first and second ORF), 
ZmMEG1-4, ZmMEG1-5, and ZmMEG1-6, respectively) and variants 

5 thereof, 

(b) a nucleotide sequence as depicted in SEQ ID No: 5, 7 f 9, 11, 13, 15, or 
58 (cDNA of ZmMEG1-1, ZmMEG1-2, ZmMEG1-3, ZmMEG1-4. 
ZmMEG1-5, and ZmMEG1-6, and genomic sequence of ZmMEG1-1, 
respectively); 

10 (c) a nucleotide s equence hybridizing under stringent conditions with the 
complementary strand of a nucleic acid molecule as defined in (a) or (b); 
(d) a sequence encoding a fragment of a protein encoded by a sequence as 
defined in any one of (a) to (c). 
Preferably, the protein encoded by the nucleic acid molecule according 
15 to the invention is specific to the BETL. 

"Specific to the BETL" means, as used in the present invention, that 
the protein is expressed predominantly in the BETL, and more particularly, 
exclusively in the BETL. 

Preferentially, the amino acid sequence encoded by a nucleotide 
20 sequence according consists in SEQ ID No: 6. 

Also preferably, the sequence encoding a fragment of a BETL protein 
according to the invention comprises a nucleic acid sequence encoding the 
PCKDNKCYCCIGGRTH sequence (SEQ ID No:54). This fragment was found 
to be conserved among all maize MEG1 proteins (ZmMEG1-1, ZmMEG1-2, 
25 ZmMEG1-3 (first and second ORF), ZmMEG1-4, ZmMEG1-5, and ZmMEG1-6) 
but also in related proteins from barley and wheat. 

Preferentially, the nucleic acid molecules comprising a nucleotide 
sequence according to the invention consists in any one of SEQ ID No: 5, 7, 9, 
11, 13 or 15, more preferentially consists in SEQ ID No: 5. Additionally said 
30 nucleic acid molecules may comprise the genomic sequence of Zm MEG1-1 
shown in SEQ ID No: 58. 
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As used herein "variants" means that the sequence differs in one or 
more positions in comparison with these sequences SEQ ID No: 6, 8, 10, 53, 
12, 14 or 16 as long as they encode a protein expressed in the BETL. 

Such molecules comprise those which are variants of the BETL protein 
5 according to the invention and differ, for example, by way of amino acid and/or 
nucleotide deletion(s), insertion(s), substitution(s), addition(s) and/or 
recombination(s) or any other modification(s) known in the art either alone or in 
combination from the above described amino acid sequences or their 
underlying nucleotide sequence(s). Methods for introducing such modifications 
10 in the nucleic acid molecules according to the invention are well-known to the 
person skilled in the art. The invention also relates to nucleic acid molecules 
the sequence of which differs from the nucleotide sequence of any of the 
above-described nucleic acid molecules due to the degeneracy of the genetic 
code. 

15 The invention further relates to a protein encoded by said nucleic acid 

molecules. More specifically, the invention provides a plant basal endosperm 
transfer cell layer (BETL) protein or an biologically active fragment thereof 
encoded by a nucleic acid molecule as defined above. Preferably, a protein 
according to the invention may comprise, or consist in, an amino acid sequence 

20 selected from the group consiting of SEQ ID No: 6, 8, 10, 53, 12, 14 and 16. 
Still preferably said protein comprises, or consists in, SEQ ID No:6. 
Additionally, a preferred BETL protein comprises amino acid sequence shown 
in SEQ ID No: 54. 

The proteins encoded by the various variants of the above-described 

25 nucleic acid molecules share specific common characteristics, such as 
biological activity, molecular weight, conformation, etc., as well as physical 
properties, such as electrophoretic mobility, chromatographic behavior, 
sedimentation coefficients, pH optimum, temperature optimum, stability, 
solubility, spectroscopic properties, etc. 

30 Hybridization stringent conditions are defined as described above. 

A " fragment " of a BETL protein or a nucleic acid molecule means a 
sequence, nucleotide sequence or amino acid sequence, which has a reduced 
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length with regard to the nucleotide sequence or amino acid sequence of 
reference. 

A preferred BETL protein fragment according to the invention 
comprises, or consists in, the amino acid sequence PCKDNKCYCCIGGRTH 
5 (SEQ ID No:54). 

All such fragments and variants of the protein of the invention are 
included within the scope of the present invention, as long as the essential 
characteristic biological properties remain unaffected in kind, that is the novel 
nucleic acid molecules of the invention include all nucleotide sequences 
10 encoding proteins or peptides which have at least a part of the primary 
structural conformation for one or more epitopes capable of reacting with 
antibodies to BETL proteins which are encodable by a nucleic acid molecule as 
set forth above. 

The nucleic acid molecule encoding a plant basal endosperm transfer 

15 cell layer (BETL) protein according to the invention can be isolated from various 
plant species, notably Angiosperm plants, Monocotyledons as Dicotyledons 
and are preferably nucleic acid molecules isolated from a plant selected from 
the group consisting of maize, teosintes, wheat, barley, rye, pea, rice, sorghum, 
and sugar cane. Preferably said plant is maize. 

20 Another object of the present invention is a nucleotide construction, 

referred to as an expression cassette, comprising a nucleic acid molecule 
encoding a plant basal endosperm transfer cell layer (BETL) protein according 
to the invention, operatively linked to regulatory elements allowing the 
expression in prokaryotic and/or eukaryotic host cells. Regulatory elements are 

25 notably 5* and 3' regulatory sequences. 

"Operatively linked* refers to functional linkage between the 5' and 3' 
regulatory sequences and the controlled nucleic acid sequence. 
The 5' regulatory sequences are notably promoters. 
Any suitable promoter could be used. It could be for example a tissue- 

30 specific promoter such as a root-specific promoter, a leaf-specific promoter, a 
seed-specific, etc. Numerous tissue-specific promoters are described in the 
literature and any one of them can be used. Preferably, the promoter is a 
pathogen inducible promoter. Such promoters include those from 
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pathogenesis-related protein, which are induced following infection by a 
pathogen, e.g., PR proteins, SAR proteins, beta-1,3 glucanase, chitinase,etc. 

Examples of promoters useful for plant transformation include the 35S 
promoter or the 19S promoter (Kay et a!., 1987), the pCRV promoter (Depigny- 
5 This et aL, 1992), the ubiquitin 1 promoter of maize (Christensen et al., 1996), 
the regulatory sequences of the T-DNA of Agrobacterium tumefaciens, 
including mannopine synthase, nopaline synthase, octopine synthase, the 
promoters regulated during seed development such as the HMWG promoter 
(High Molecular Weight Glutenin) of wheat (Anderson O.D. et al M 1989, 
10 Roberts et al., 1989), the waxy, zein or bronze promoters of maize, a promoter 
that is inducible by pathogens. The promoter may be a promoter as depicted in 
anyofSEQIDNo:1 to 3. 

Preferentially, the promoter used is a pathogen inducible promoter or 
alternatively a constitutive promoter used to control the desired level of disease 
15 control (resistance to pathogen) in the plant 

The 3' regulatory sequence is notably terminators. 

Among the terminators useful for plant transformation within the 
framework of the present invention, the ones which can be used are the polyA 
35S terminator of the cauliflower mosaic virus (CaMV), described in the article 
20 of Franck et al. (1980), the NOS terminator corresponding to the region in the 
non coding 3* region of the nopaline synthase gene of the Ti-plasmid of 
Agrobacterium tumefaciens nopaline strain (Depicker et al. 1992), the histone 
terminator (EP 0 633 317), and the tml terminator. 

Preferentially the terminator is the 3'Nos or 3'CaMV terminator. 
25 Any other element, as previously described (intron, enhancer, transit 

peptide,...) may be comprised in the expression cassette. 

According to the invention, the expression cassette, comprising a 
nucleic acid molecule encoding a plant basal endosperm transfer cell layer 
(BETL) protein as defined above, operatively linked to regulatory elements may 
30 further comprises one or several selection marker gene for plants, useful for 
transformation and selection. 

The selectable marker genes useful have been described previously. 
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Another object of the invention is any nucleotide vector referred to as 
an expression vector, such as a plasmid, which can be used for transforming 
host cells, characterized in that it contains at least an expression cassette 
comprising a nucleic acid molecule encoding a plant basal endosperm transfer 
5 cell layer (BETL) protein, as defined above. 

Examples of useful expression vector have been described above. 
Another object of the invention is a host cell, containing at least an 
expression vector as described above. 

Examples of useful host cells have been described above. 
10 The invention also relates to a transgenic plant, or a part of a 

transgenic plant (leaves, plant cell, plant tissue, grain, fruit, seed,...) 
comprising a cell as described, notably comprising stably integrated into the 
genome a nucleic acid molecule encoding a plant basal endosperm transfer 
cell layer (BETL) protein, operatively linked to regulatory elements allowing 
15 transcription and/or expression of the nucleic acid molecule in plant cells. 

A plant or part of a plant (plant cell, plant tissue, grain, seed, leaves,...) 
according to the invention could be a plant or a part of it from various species, 
notably an Angiosperm, Monocotyledons or Dicotyledons, preferably a cereal 
or oily plant, selected in particular from the group consisting of maize, rice, 
20 wheat, barley, rape, and sunflower, preferentially maize. 

In yet another aspect, the invention also relates to harvestable parts 
and to propagation material of the transgenic plants according to the invention 
which either contain transgenic plant cells expressing a nucleic acid molecule 
according to the invention or which contain cells which show a reduced level of 
25 the described protein. Harvestable parts can be in principle any useful parts of 
a plant, for example, leaves, stems, fruit, seeds, roots etc. Propagation material 
indues, for example, seeds, fruits, cuttings, seedlings, tubers, rootstocks etc. 

The invention further relates to a plant basal endosperm transfer cell 
layer (BETL) protein or an immunologically or biologically active fragment 
30 thereof encodable by a nucleic acid molecule according to the invention. 

The invention also relates to an antibody specifically recognizing a 
BETL protein according to the invention or a fragment, such as SEQ ID No: 54, 
or epitope thereof. 
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These antibodies can be monoclonal antibodies, polyclonal antibodies 
or synthetic antibodies as well as fragments of antibodies, such as Fab t Fv or 
scFv fragments etc. Techniques for producing such antibodies are classical 
methods well known by the one skilled in the art. 
5 Another object of the invention is a method for improving plant 

pathogen resistance, comprising the steps consisting of. 

a) transforming at least a plant cell by means of at least a vector as defined 
previously; 

b) cultivating the cell(s) thus transformed so as to generate a plant 
10 containing in its genome at least an expression cassette according to the 

invention, whereby a plant with improved pathogen resistance is 
obtained. 

Pathogens of the invention include, but are not limited to, viruses or 

15 viroids, bacteria, insects, nematodes, fungi, and the like. For example, 
Fusarium species (notably Fusarium moniliforme, Fusarium graminearum), 
Sclerotinia sclerotlorum, Phoma, Com root worm, Aphis gossypii, etc. For 
maize, this would include especially ear mold fungal pathogens, such as 
Fusarium monoliforme. 

20 Biological activity of the BETL proteins according to the invention can 

be assayed by any method known in the art. Assays to measure for example 
antimicrobial activity and the developmental pathways and defense responses 
that are influenced by the BETL protein are well known in the art. 

Assays that measure antipathogenic activity (resistance to pathogen 

25 activity) are commonly known in the art, as are methods to quantitate disease 
resistance in plants following pathogen infection. Such techniques notably 
include, measuring over time, the average lesion diameter, the pathogen 
biomass, and the overall percentage of decayed plant tissues. 

Antimicrobial and antipathogenic compositions are also included in the 

30 scope of the invention. The compositions of the present invention can be in a 
form suitable for direct application to the plant in need thereof or as a 
concentrate of a primary composition, which requires dilution with a suitable 
quantity of water or other diluent before application. 
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Additionally, the compositions can be used in formulations used for 
their antimicrobial and antipathogenic activities (resistance to pathogens). The 
proteins of the invention can be formulated with an acceptable carrier into a 
pesticidal composition(s) for example, a suspension, a solution, an emulsion, a 
5 dusting powder, a dispersible granule, a wettable powder, and an emulsifiable 
concentrate, an aerosol, an impregnated granule, an adjuvant, a coatable 
paste, and also encapsulations in, for example, polymer substances. 

The methodologies for transformation, selection, regeneration, are 
known to one skilled in the art and have been previously described. 

10 

Another object of the invention is a method for improving the 
agronomic quality of a plant, comprising the steps consisting of : 
a) transforming at least a plant cell by means of at least a vector as defined 
previously; 

15 b) cultivating the cell(s) thus transformed so as to generate a plant 
containing in its genome at least an expression cassette according to the 
invention, whereby a plant with improved agronomic quality is obtained. 

More particularly, said plant exhibits an increased seed size as 
20 improved agronomic quality. 

The overexpression of MEG1 protein into a plant result in a increased 
seed size. When overexpressing MEG1 protein, any promoter could be used in 
the expression cassette. 
25 Preferably the promoter is chosen amongst a BETL specific promoter 

such as any one of the promoters according to the present invention, or the 
BETL2 promoter (WO 99/50427). 

Moreover, the present invention relates to a kit comprising at least one 
30 of the aforementioned nucleic acid molecules, probes, conserved sequences, 
vectors, proteins, compositions or antibodies of the invention. The kit, or its 
components, according to the invention can be used in plant cell and plant 
tissue cultures, for example to detect expression levels of the transgene. The 
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kit and its application are particularly useful to screen for antipathogenic effects 
in fungal cultures. The kit of the invention and its components are expected to 
be very useful in breeding new varieties of, for example, plants which display 
improved properties such as nutritional value or disease resistance (resistance 
5 to pathogens). 

The present invention will be further understood in view of the annexed 
figures and following examples. 

10 Figures 1 and 2 represent Allelic Message Display (AMD) analysis of 

ZmMEG1-1. 

Figure 1 is an autoradiograph of an AMD gel showing a maternal 
pattern of expression forZmMEG1-1 (arrows). 1, F2 selfed; 2, Mo17 selfed; 3, 
F2 x Mo17; 4, Mo17 x F2; 5, A69Y selfed; 6, F2 selfed; 7, F2 x A69Y; 8, A69Y x 
15 F2. 

Figure 2 depicts an allele-specific RT-PCR analysis of ZmMEG1-1 in 
the endosperm. Top, ZmMEG1-1 maternal allele is monoallelic at 4 DAP 
(single band) and biallelic at 12 DAP (double band). Bottom, monoallelic 
expression of Meg1 in interploidy crosses at 4 DAP. The parental inbred lines 

20 used were F2, A69Y (69) and tetraploid W23 (4x). 

Figure 3 represents the amino acid alignment of putative proteins with 
a ZmMEG1-1 domain. Arrows indicate the position of conserved cysteines and 
a circle indicates the position of conserved tyrosines putatively involved in the 
anchoring to the cell wall. 

25 Figures 4 to 7 represent ZmMEG1-1 protein characterization, and 

expression analysis of ZmMEG1-1 and five related cDNAs. 

Figure 4 : RT-PCR analysis was carried out with different RNA 
samples isolated from a range of tissues and amplified with gene specific 
primers (see oligonucleotide table 1). Dark and light box coloration indicate 

30 high and intermediate signal respectively, and a white box indicates absence of 
signal. 

Figure 5 depicts the In situ hybridization analysis of ZmMEG1-1 in 
developing seeds. Left, 4 DAP; Right 12 DAP. 
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Figure 6 shows Immunolocalization of ZmMEG1-1 protein. Left, 12 
DAP BETL region; Right, cell wall localization of ZmMEG1-1 in BETL cells. 

Figure 7 i s a Western analysis of proteins isolated from 10 DAP 
endosperms after subcellular fractionation. 
5 1, cytoplasmic fraction I; 2, cytoplasmic fraction II; 3, cell wall fraction I; 

4, cell wall fraction II. Arrows indicate the presence of different ZmMEG1-1 
proteins. Em, embryo; En, endosperm; ESR, embryo surrounding region; PC, 
pedicel. 

Figure 8 indicates that ZmMEG1-1 is a member of a small gene family 
10 of which parental alleles are differentially methylated in the endosperm 
ZmMEG1-1 methylation analysis in embryo (1-4) and endosperm (5-8) at 6 
DAP. 

1 , W22 self; 2, A69Y self; 3, W22 x A69Y; 4, A69Y x W22; 5, W22 self; 6, A69Y 
self; 7, W22 x A69Y; 8, A69Y x W22. Arrows indicate the presence of 

15 differentially methylated genomic fragments in the endosperm detected with a 
ZmMEG1-1 gene specific probe. 

Figure 9 represents the result of a sequence alignment of ZmMeg1-1 
and ZmMeg1-2 with BAP proteins. 

Figure 10 represents analysis of the ZmMEG1-1 promoter region. Left, 

20 a minimal promoter sequence of ZmMEG1-1 is conserved amongst several 
ZmMEG1~1 promoters from the F2 inbred line. Right, these promoters showed 
high conservation of the four regions also present in other BETL-specific genes 
(named box I to IV). White boxes, putative TATA motifs; boxes I to IV are the 
conserved sequences amongst promoter regions of several known BETL- 

25 specific genes. 

Figure 11 represents the differential expression of pZmMEG1-1:GUS 
in maize endosperm when inherited maternally and paternally. (A-J) sagittal 
sections of kernels stained for the GUS reporter, where (A, C, E, G and I) are 
seeds from an A188::pZmMEG1-1:GUS x A188 cross; (B, D, F, H and J) are 

30 seeds from the reciprocal A188 x A188::pZmMEG1-1:GUS cross. Note the 
presence of GUS signal in (A) and (C) at early stages of endosperm 
development but absent when paternally transmitted (B and D). (A and B) 4 
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DAP; (C and D) 6 DAP; (E and F) 10 DAP; (G and H) 15 DAP; (I and J) 20 
DAP. 

Figure 12 is an analysis of the minimal promoter region of ZmMEG1-1. 
Left, Schematic representation of the promoter-deletion constructs employed in 

5 a co-bombardement experiment with a p35S:MRP1 construct Right, results 
obtained from the co-bombardment assay. Plus signs indicate the presence of 
GUS staining (transactivation); Minus signs denote absence of GUS staining 
(lack of transactivation). 

Figure 13 is a diagrammatic representation of a crossing strategy 

10 enabling to obtain crops with improved value. 

Figure 14 depicts a pairwise sequence alignment of MEG1 (SEQ ID 
No: 1 = promoter MEG 1-1), MEG2 (SEQ ID No: 2 = promoter MEG 1-2), and 
MEG3 (SEQ ID No: 3 = promoter MEG 1-3) performed with CLUSTAL W 
(1.74). The nucleotides shown in bold correspond to SEQ ID No: 57. 

15 Underlined nucleotides correspond to conserved regions between BETL 
specific genes (megl, betl's and DDIs), and shaded nucleotides correspond to 
conserved region between the above genes and ESRs. Boxed nucleotides 
correspond to TATA boxes. 

20 EXAMPLES : 

The invention will now be described by the way of the following 
examples, which should not be construed as in any way limiting the scope of 
the invention. 

25 Several oligonucleotides and primers could be used to amplify or 

isolate the sequences or sequence fragments according to the present 
invention. The oligonucleotides and primers described are not limiting. The 
man skilled in the art knows how to design an oligonucleotide, or a specific 
oligonucleotide, based on a sequence. 

30 

EXAMPLE 1 : Plant material and growth conditions 
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Maize (Zea mays L.) diploid inbred lines W22, F2, B73 and Mo17, and 
tetraploid W23 (COOP Maize Stock Centre) were glasshouse grown under the 
following regime: 16 h day length (supplemented with metal halide lamps at 
250mmols, when required) at 22-28°C during the day, and at 16-20°C at night. 
5 Humidity levels were set at approximately 40-50 % daytime and 60-70% at 
night All lines were selfed and reciprocally crossed, and kernels were 
harvested at 6-24 days after pollination (DAP). Embryos and endosperms were 
isolated and pooled, then frozen in liquid nitrogen and stored at -80°C. 

10 

EXAMPLE 2 : Identification and characterization of the ZmMegl gene 
family 

A genomic screen designed to identity endosperm transcripts showing 
15 parent-of-origin patterns of expression was carried out by allelic message 
display (AMD) (Guti§rrez- Marcos et al., 2003). 

Allelic message display (AMD) : 

Total endosperm RNA was extracted from selfed and reciprocally- 
20 crossed inbred lines and used for AMD-PCR according to the protocol of 
Hagiwara et al. (1997). 

Reverse transcribed RNA was used as source material for PCR 
(HIEROGLYPH kit; Genomix-Beckman), labeling with [alpha 33P]-dATP. A 
combination of 240 primers was used to carry out PCR reactions and products 
25 were analyzed in a semi-automatic Genomix LR DNA sequencing system. 
Following exposure to film (Biomax MR, Kodak), candidate bands were excised 
from the gel and amplified using PCR. 

This resulted in the presence of several fragments showing monoallelic 
maternal expression, one of which was termed Zea mays maternally- 
30 expressed-genel (ZmMEGI) (Figure 1). 

Identification of ZmMEGI cDNAs and expression analysis. 
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2.1) Isolation of Meal cDNA and related sequences : 

Full length cDNAs were obtained after screening a 7 DAP maize 
endosperm cDNA library (Hueros et aL, 1999b). Among the 500,000 plaques 
screened, five cDNA clones were identified and sequenced. This identified the 
5 full-length cDNA, ZmMEG1-1 , plus three other similar, but not identical, cDNAs 
which we termed ZmMEG1-2, -3, and -4. After screening the maize genome 
database, a further two ESTs showing partial similarity to the C-terminus of the 
translated MEG 1-1 protein sequence were identified and denominated 
ZmMEG1-5, and-6. 

10 

2.2) Mapping : 

The ZmMEGI gene family was mapped to the short arm of maize 
chromosome 7, between markers csu13 and bngl1200, by restriction fragment 
length polymorphism (RFLP) analysis using a population of immortalized F2 
15 lines. 

2.3) Southern analysis : 

The diversity of the ZmMEGI family was revealed in the F2 inbred line 
by Southern analysis and BAC screening, this analysis indicated that the 
20 ZmMEGI gene family comprises from two to four gene clusters with a few 
copies present in every cluster. 

For Southern blotting, DNA was digested with the restriction enzyme 
Xhol, the hybridization solution was 5xSSPE / 5x Denhardfs/O.SyoSDS, and the 
blots were probed with full length Meg 1-1 coding region (SEQ ID No: 5). 
25 Southern analysis under high stringency conditions (hybridization 65°C 

and wash for 10 minutes at 65°C with 0.1xSSPE/0.5%SDS) revealed that the 
ZmMEGI gene family comprises between 5-8 copies present in most maize 
inbred lines, teosintes and other grasses (rice, sorghum, sugar cane). 

Southern analysis under low stringency conditions (hybridization 50°C 
30 and wash for 10 minutes at 65°C with 2xSSPE/0.5%SDS) also revealed bands 
in wheat and barley. 



2.4) Expression Pattern : 
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The expression pattern of ZmMEG 1-1 and the other ZmMEG f-like 
cDNAs (ZmMEG1-2, ZmMEG1-3, ZmMEG1-4, ZmMEG1-5, ZmMEG1-6) was 
assessed by reverse transcription-polymerase chain reaction (RT-PCR) using 
gene-specific oligonucleotide pairs shown in Table 1 below. 

Table 1 . List of oligonucleotides used for RT-PCR analysis. 



Gene 
Name 


Forward Oligonucleotide 


Reverse Oligonucleotide 


PCR 
size 
(bp) 


MEG1-1 


TGCTGCTCATGCGCATGGGGC 
TG. SEQ ID No:23 


TTAGAAGCAKGCATGWCTACA 
CTSAGCC, SEQ ID No:24 


165 


MEG1-2 


ATGCACATGGGAAGGGTCATG 
TC, SEQ ID No:25 


TTAGAAGCAKGCATGWCTACA 
CTSAGCC, SEQ ID No:26 


193 


MEG1-3 


GCATAGCAGGAGTGGAGGGC, 
SEQ ID No:27 


GAAGCAGGCATGACTACACTC, 
SEQ ID No:28 


292 


MEG1-4 


TGGCCAATGTCGCCTCCGAG, 
SEQ ID No:29 


TTAGAAGCAKGCATGWCTACA 
CTSAGCC, SEQ ID No:30 


320 


MEG1-5 


ATGGCTGGCTATGGTGTTGAT 
G, SEQ ID No:31 


GTGCAGTTTGCAGGTAAGCCC, 
SEQ ID No:32 


150 


MEG1-6 


TGTACGCCTGACTTGGCTAGC 
AACC, SEQ ID No:33 


TTAGAAGCAKGCATGWCTACA 
CTSAGCC, SEQ ID No:34 


181 


GST 


GCAACGTACCGTACCTTTCCG 
A, SEQ ID No:35 


ACGCTGCATTCAATTACCGGG 
AAG, SEQ ID No:36 




GUS 


GGGCCAACAGTTCCTGATTAA 
CC, SEQ ID No: 55 


CCCCGTTG ACTGCCTCTTCG , 
SEQ ID No: 56 





The expression pattern is shown in Figure 4. 
10 Specific ZmMeg1-1 alleles showed 99.9% identity in several inbred 

lines (A188, B73, Mo17, F2, W22 and tetraploid W23) and were detected using 
cleaved and amplified polymorphic sequences (Neff et al., 1 998). 

To distinguish between ZmMeg1-1 alleles, RT-PCR analysis was 
performed employing MEG1 (5-TGCTGCTCATGCGCATGGGGCTG-3\ SEQ 
15 ID No:17) and MEGIHpal (5-TTGTATATAAAAACAGTG ATGTTAA-3' , SEQ ID 
No: 18) primers to produced a DNA fragment that after tfpal digestion 
generated fragments of 177-bp for F2 and 1 98-bp for A69Y or W23. 

The gluthatione synthasel gene was used as a control for a biallelic 
parent-of-origin expression. 
20 ZmMEG 1-1 transcript was only present in endosperm samples 

between 4 and 20 DAP indicating that its expression is endosperm-specific. 
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ZmMEG1-2 and ZmMEG1-4 also showed a similar pattern of expression, 
whereas ZmMEG1-3 was expressed in endosperm as well as in anther and 
pollen samples. By contrast, ZmMEG1-5 and ZmMEG1-6 were found to be 
expressed in most tissues tested. 

5 

EXAMPLE 3 : Parent-of-origin pattern of expression of ZmMEG1-1 

3.1) ZmMea1-1 has a maternal parent-of-origin pattern of expression 
10 To confirm the maternal parent-of-origin expression pattern of 

ZmMEG1-1 t a polymorphism was detected in the 3'UTR of the gene, between 

F2 and A69Y, and W23 and other inbred lines, which allowed us to design 

oligonucleotides for allele-specific RT-PCR. 

The results revealed exclusive expression of the ZmMEG1-1 maternal 
15 allele at early stages of endosperm development (4 DAP) (Figure 2). 

Surprisingly, it has been found that at later stages of endosperm development 

(12 DAP) ZmMEG1-1 expression became biallelic (Figure 2). 

To investigate the effects of altering the maternal to paternal genomic 

ratio in the endosperm on ZmMEG1-1 expression, reciprocal crosses between 
20 diploid and tetraploid inbred lines were performed. Allele-specific RT-PCR 

showed that the ZmMEG1-1 expression pattern remained unaltered (Figure 2). 

3.2) ZmMea1-1 alleles are differentially methylated in the endosperm 
Since ZmMEG1-1 is subjected to a parent-of-origin pattern of 

25 expression in the endosperm it can be predicted that changes in methylation 
might occur between the different parental alleles. 

The inventors have tested this hypothesis by examining the 
methylation status of ZmMEG1-1 parental alleles using methylation-sensitive 
restriction enzyme digestion and Southern analysis with a gene specific probe 
30 on genomic DNA obtained from embryo and endosperm samples. 

Differences in the methylation status of parental ZmMEG1-1 alleles 
were apparent in the endosperm alone; ZmMEG1-1 maternal alleles appeared 
demethylated whereas paternal alleles were methylated (Figure 8). 
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These results strongly suggest a correlation between methylation 
status of a given ZmMEG1-1 allele in the endosperm and a parent-of-origin 
pattern of expression. 

5 

EXAMPLE 4 : ZmMeg1-1 protein characterisation 

4.1 ) ZmMea1-1 encodes a basal endosperm transfer cell wall protein 
The full length ZmMEG1-1 cDNA contained an open reading frame of 

10 88 amino acids (Figure 3), encoding a predicted 9794 Da protein with a pi of 
7.5. The ZmMEG1-1 polypeptide contains a hydrophobic N-terminal region with 
characteristics of a 27 amino acid signal peptide (von Heijne, 1986). Further 
analysis identified a putative cleavage site between His-26 and Glu-27, thus 
producing a polypeptide containing 61 amino acids, with a predicted molecular 

15 mass of 6730 Da and pi 6.2. 

An interesting feature of the ZmMEGI polypeptide is the presence of a 
highly conserved cysteine-rich domain in the C-terminal portion, also present in 
the ZmMEGI A\kes (ZmMEGI -2, -3, -4, -5, and -6) (Figure 3). This conserved 
region contains six cysteines and two tyrosines, and resembles other cysteine 

20 domains found in several plant proteins with hypothetical cell-wall localization 
(de Oliveira et al., 1990; Domingo et al., 1999). 

In situ hybridization of ZmMEG1-1 was performed on kernel sections 
at various stages of endosperm development with a gene specific probe 
kernels according to a published method (Jackson, 1991), with minor 

25 modifications DB (Costa et al., 2003). (Figure 5). 

No signal was detected with the sense probe, however, the antisense 
probe only gave signal in the transfer cells from 4 DAP and showed maximum 
expression at around 6-10 DAP (Figure 5), clearly indicating that ZmMEG1-1 is 
exclusively expressed in the basal transfer region of the endosperm. 

30 

4.2) ZmMEGI -1 protein localization : 

To determine the localization of ZmMEGI -1 protein in maize 
endosperm, a polyclonal antiserum was raised by using a synthetic peptide 
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designed to the N-terminus of the putatively processed ZmMEG1-1 
polypeptide. 

Immunolocalization 

5 Polyclonal antiseaim was raised in rabbit against a synthetic peptide 

(NAPAEEGILREKRAQC, SEQ ID No: 19) and affinity purified with an 
immobilized peptide using a Sulpholink coupling gel system (Pierce, UK). 
Maize kernels were fixed in 4 % paraformaldehyde in 0.1 M sodium phosphate 
buffer pH 7.2, for 12-24h depending on the tissue volume. Samples were 

10 dehydrated in an ethanol series and wax embedded. Sections were de- 
paraffinized and blocked in 1% BSA in PBS (10mM sodium phosphate, 150mM 
NaCI pH 7.4) for 30 min at room temperature, and incubated overnight with 
anti- MEG1 antiserum or pre-immune serum (both diluted 1:500). The 
immunoreactions was detected using alkaline phosphatase-coupled secondary 

15 antibody (Sigma; diluted 1:1000) and NBT/BCIP as substrate. 

Immunolocalization was performed using the purified antiserum, which 
detected ZmMEG1-1 protein adjacent to the cell wall ingrowths of basal 
endosperm transfer cells (Figure 6). 

20 

To obtain biochemical evidence of protein localization to the cell wall, 
protein extracts from 10 DAP endosperms were fractionated as described by 
Sema et al. (2001) and analyzed by immunoblotting (Figure 7). 

25 Subcellular fractionation of proteins 

Fractionation of subcellular components of endosperms (F2 inbred) 
was performed as described by Serna et al. (2001) . Proteins were fractionated 
by SDS-PAGE electrophoresis according to (Laemmli, 1970), and 
electroblotted onto a polyvinylidine difluoride (PVDF) membrane. Proteins were 

30 detected using an enhanced chemiluminescence method (ECL, Amersham, 
UK). 

Several proteins with predicted molecular masses that ranged from 6 
to 50 kDa were present in the cytoplasmic fractions, two of which corresponded 
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to ca. 9-10 kDa and ca. 6-7 kDa. Interestingly, the ca. 6-7 kDa proteins gave 
the greatest signal intensity in the cell wall preparation, thus providing evidence 
for cell wall localization of the putative cleaved ZmMEG1-1 polypeptide (Figure 
7). Three other proteins with molecular masses between 2.5 to 3.5 kDa were 
5 observed in the cell wall fraction. 

4.3lZmMEG1-1 role: 

The BETL and BAP families comprise the largest basal transfer layer - 
specific protein families found to date. These small proteins are secreted into 

10 the pedicel region or are localized to the cell wall and are strongly suggested to 
be implicated in antifungal attack and pathogen defence (Hueros et al., 1995; 
Serna et al., 2001). 

As known from the art (Hueros et al. 2002), the BETL-specific genes 
isolated to date share some structural features that strongly suggest that they 

15 play a role in defense against pathogen entry into the developing seed. They 
encode small, Cys-rich secreted peptides that accumulate only transiently 
during endosperm development. 

Amino acid sequence analysis of ZmMeg1-1 revealed a cysteine-rich 
motif in the C-terminal region comprising six cysteine residues and two tyrosine 

20 residues: 0(X)4-CYCC-(X)8-Y-(X)4- C-(X)3-C (SEQ ID No: 20), which was 
conserved in the other ZmMEG1-like proteins (ZmMEG1-2, ZmMEG1-3, 
ZmMEG1-4, ZmMEG1-5 and ZmMEG1-6) (figure 3) and showed high similarity 
to many cysteine-rich domains identified in other plant proteins (Hueros et al., 
1995; Domingo et al., 1999; Hueros et al., 1999b; Sema et al., 2001) (Schopfer 

25 et al., 1999). Cysteine-rich domains are believed to be required for the 
maintenance of particular protein conformations, which result in the exposure 
of interdigitated side chain residues (Berg and Shi, 1996), such as tyrosine 
residues found in ZmMEGI. Importantly, it has been reported that exposure of 
tyrosine residues facilitate the binding of synthetic-proteins to the cell wall 

30 (reviewed in Cassab, 1998). 

Since ZmMEG1-1 protein localization is proximal to the maternal-filial 
Interface, it is strongly expected that it may have a defensin-like function, as 
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reported for BETL1,3 (Hueros et aL, 1995; Hueros et aL, 1999a) and ZmES 
proteins (Cordts et al., 2001). 

Furthermore the modification of MEG1 protein level modifies seed size 
as the decrease of MEG1 protein level induces a reduced seed size (see 
5 below). 

An alignement of MEG1-1 and MEG1-2 amino acid sequence with that 
of BAPs proteins has been performed and the result is shown in Figure 9. 

Two Meg1 constructs (expression cassettes) to determine the effect of 
Meg1 gene overexpression in maize have been made : 

10 

Maize transformation : 

Embryogenic type II calli were transformed with the construct and 
regenerated as described by Bonello et aL, 2000. 

15 The following constructs have been obtained by classical molecular 

biology methods. 

Construct A : p35S - HSP70intron - Meg1 genomic - 3 'CaMV 
34 maize transformants have been generated, some plants exhibit 
20 improved resistance to the pathogens. 

Construct B : p35S - HSP70intron - Meg1 cDNA- 3 'CaMV 
36 maize transformants have been generated, some plants exhibit 
improved resistance to the pathogens. 

25 

Construct C : p35S Meg1 - RNAi - 3 ' OCS 

A Meg1 construct (expression cassette) to determine the effect of 
Meg1 downregulation has been made. 

30 Vector pGUSART, was obtained by subcloning an EcoRV fragment (789 

to 1820) of the GUS gene into pART7 (Gleave A.P (1992) Plant Mol. Biol. 
20:1203-1207). 
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A fragment of the Meg 1-1 cDNA sequence (151 to 380) was amplified by 
PCR using two oligonucleotides: 

megl RNAi.FOR (Clal-EcoRI) S'-ATCGATGAATTC- 

5 GCTCAAGGGTTTCTTCCATG -3' (SEQ ID No: 63) 

megl RNAi.REV (BamHI-Xhol) 5'-GGATCCTCGAG- 

CCTCTAGTATCGGTCTGAC -3' (SEQ ID No: 64) 

10 The PCR product was subcloned into pGEM-T easy and sequenced. 

After digestion with BamHI and Clal, and EcoRI and Xhol the fragments were 
subcloned into pGUSART to generate a 3-5' and 5-3* construct flanking the 
deleted portion of the GUS gene capable of produce a duplex formation. 

15 24 maize transformants have been generated, some plants exhibit 

reduced resistance to the pathogens. 

In particular, 4 out of the 24 Megl RNAi plants exhibited a segregation 
in seed size on the cob. Kernels that inherited the RNAi transgene were found 
to be smaller than wild type segregant kernels (figure not shown). Thus 

20 inhibition of Megt expression reduces seed size. 

EXAMPLE 5 : ZmMEG1-1 promoter isolation and sequence 
characterization 

25 

5.11 Meal Genomic DNA isolation 

A Meg1.1 probe was used to screen an F2 BAC library (O'Sullivan et 
al„ 2001), resulting in six genomic ZmMeg1-1 fragments that were subcloned 
into pBluescript KSII (Stratagene,UK) and sequenced. Sequence analysis 
30 revealed the existence of at least four different copies of ZmMEG1-1 in the F2 
inbred line, all of which showed subtle differences in their promoter region, yet 
highly conserved exon and intron regions. A minimal region of the promoter 
(ca. 370 bp) was found to be conserved across other promoters belonging to 
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known basal endosperm transfer cell-specific genes (Hueros et al., 1999b; 
Sevilla-Lecoq et ai., 2003) (Figure 10). 

5.2) Reporter construct preparation : 

5 A minimal promoter region of ZmMEG1-1 was isolated by PCR using 

two specific oligonucleotides (5'-ACACCTCAAATAGATATGGATATAG-3\ SEQ 
ID No : 60, and 5*-GTCGCAAGAAATGTTAAGGAACTCC-3\ SEQ ID No: 61) 
and subcloned into pGEM-T easy vector (Promega, Madison, USA). 

After digestion with EcoRI and C/al, the 371 bp fragment was fused to 

10 the p-glucoronidase (GUS) encoding sequence and the nopaline synthase 
(NOS) terminator of the pSU4K1 vector (Jones et al., 1992) to generate the 
transcriptional fusion vector denominated pZmMEG1-1-GUS. 

5.3) Maize transformation and expression analysis : 

15 To test whether the minimal promoter region of ZmMEG1-1 was able 

to confer transfer cell-specific gene expression we stably transformed maize 
plants with the pZmMEG1-1 -GUS transcriptional fusion construct. 

Maize transformation : 

20 Embryogenic type II calli were transformed with the construct and 

regenerated as described by Bonello et al., 2000. 

Five independent transgenic lines were generated (0757-1 C, 0757-2E, 
0757-2F, 0757-2D, 0757-2L) and selected on the basis of high levels of p- 
glucuronidase expression, as detected by histochemical staining of kernels. 

25 Plants were genotyped for (3-glucuronidase (GUS) transcriptional fusions via 
PCR using GUS-specific oligonucleotides (SEQ ID No: 55 and 56). 
Histochemical analysis of transgeneic lines was performed according to 
(Jefferson et al., 1987; Costa et al., 2003). Briefly, fixed kernels were 
dehydrated to 70 % ethanol before analysis and digitally imaged. 

30 GUS staining was only ever observed in endosperm tissue, and not in 

any other plant tissues tested. 
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These lines were subsequently backcrossed with pollen from an A188 
standard inbred for four consecutive generations to confirm the stable pattern 
of expression in the endosperm. 

Two of the independent transgenic lines (0757-2E and 0757-2D) were 
5 randomly selected and reciprocally outcrossed with wild-type A188 plants to 
test for changes in patterns of transgene expression based on their parental 
mode of inheritance. 

For each line, around 200-280 kernels at different developmental 
stages were isolated from four plants and histochemically stained for GUS. For 
10 each time point analyzed, around 50-70 kernels were isolated from four plants, 
and histochemically stained for GUS, as mentioned above. 

Interestingly, we found differential temporal GUS expression when 
transmitted either maternally or paternally. 

When pZmMEG1-1-GUS plants were either selfed or outcrossed as 
15 females, GUS staining was present in transfer cells from 4 DAP, when the 
endosperm becomes fully cellular, and remaining until 20 DAP. . A maximum 
level of GUS staining was attained at 10-12 DAP. 

However, when pZmMEG1-1-GUS plants were outcrossed as males 
(i.e. transmitted paternally through pollen), histochemical GUS activity was only 
20 ever detected after 10 DAP, when it was at its highest, and the declined in 
expression by 20 DAP. The apparent delay in expression of the paternally 
inherited transgene was confirmed at the transcriptional level by Northern 
analysis (data not shown). 

Taken together, the data demonstrate that a minimal region of MEG1 -1 
25 promoter is able to confer different temporal transfer cell-specific expression of 
uidA reporter gene depending on parental origin. 

So, expression of pZmMEG1:GUS in BETL cells is dependent on 
parental inheritance. 

30 

EXAMPLE 6 : Transactivation of the ZmMEG1-1 promoter by ZmMRPI 
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It has been recently found that a transfer cell-specific transcription 
factor, ZmMRPI, is able to transactivate expression of several BETL promoters 
(Gomez et aL, 2002). 

To investigate whether ZmMRPI was also capable of transactivating 
5 ZmMEG1-1, etiolated maize leaves were cobombarded with the pZmMEG1-1- 
GUS construct and a 35S enhanced version of the ZmMRPI protein (p35S- 
MRP1 translation^ fusion). 

The data showed that pZmMEG1-GUS was significantly transactivated 
by ZmMRPI (Figure 11), indicating that ZmMRPI is sufficient for activation of 
10 ZmMEG1-1 in leaf tissues and is probably sufficient to activate ZmMEG1-1 
expression in endosperm transfer cells. 

The same results are obtained with the MEG1 promoter according to 
SEQ ID No: 62. 

15 

EXAMPLE 7 : Promoter deletion analysis 

To define the region essential for transactivation by ZmMRPI, we 
generated a deletion series of the conserved ZmMEG1-1 promoter region and 
20 fused these to the uidA reporter gene. 

The serial deletion analysis was performed by PCR amplification of 
different regions of the ZmMEG1-1 promoter, using multiple oligonucleotide 
pairs (see table 2 below) and then subcloning into pGEM-T easy vector 
(Promega, Madison, USA). 



• \ 



WO 2005/042745 PCT/EP2004/052760 



42 



Table 2. List of oligonucleotides used in deletion analysis. 



Construct 
name 


Forward Oligonucleotide 


Reverse Oligonucleotide 


pMEG1- 
GUS 


ACACCTCAAATAGATATGGATATA 
, SEQ ID No:37 


GTTATCTATTCTATTCTATCATATCTA 
TC, SEQ ID No:38 


pMEG1- 
31/30GUS 


GATATAGATATATAGAAGAGATAT 
AGATGG, SEQ ID No:39 


GTTATCTATTCTATTCTATCATATCTA 
TC, SEQ ID No:40 


pMEG1- 
32/30GUS 


AGATAGATATGATAGAATAGATA 
GATAAC, SEQ ID No:41 


GTTATCTATTCTATTCTATCATATCTA 
TC, SEQ ID No:42 


pMEG1- 
33/30GUS 


ATTTTGTCTAAAGAGACTAAATCA 
CTGC. SEQ ID No:43 


GTTATCTATTCTATTCTATCATATCTA 
TC. SEQ ID No:44 


pMEG1- 
30/31 GUS 


ACACCTCAAATAGATATGGATATA 
, SEQ ID No:45 


CCAATTCACTGGGTTATCTATTCTAT 
TCTATCATATCT, SEQ ID No:46 


pMEG1- 
30/32GUS 


ACACCTCAAATAGATATGGATATA 
, SEQ ID No:47 


CCAATTCACTGGCCATCTATATCTTC 
TATATATCTATA, SEQ ID No:48 


pMEG1- 
30733GUS 


ACACCTCAAATAGATATGGATATA 
, SEQ ID No:49 


CCAATTCACTGGCCCCTTGTTATATC 
TATATCTATACC, SEQ ID No:50 



Constructs were digested with EcoRI and BsfXi and fragments were 
5 subcloned into the pZmMEG1-1-GUS construct pre-digested with EcoRI and 
BsfXi. By following this approach the putative TATA boxes identified in the 
ZmMEG1-1 promoter remained intact. 

The ZmMRP-1 coding region was isolated by PCR using 
oligonucleotides MRP. FOR (5'GGATCCATGAATCCCAACTTCAACAGTG3\ 
10 SEQ ID No:21) and MRP.REV 

(5 , GAATTCTTATCGGTTATATATCTGGCTCTCC3 , f SEQ ID No:22). PCR 
fragments were subcloned in pGEM-T easy (Promega, Madison, USA), 
digested with BamHU EcoR\ and the 327bp fragment was subcloned into the 
plasmid described by Pang et al. (1996) to generate p35S:HSP70::MRP1. 
15 Plasmid DNA was isolated by the QIAprep midi kit (Qiagen, Germany) and 
coated onto tungsten (M10) particles according to (Klein et al., 1992). 

For transient transformation, Hi-ll maize seeds were surface sterilized 
and germinated in the dark. Etiolated leaves (2cm wide) were sectioned into 1- 
2cm long pieces and co-bombarded with ZmMEG1-1 promoter deletion and 
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ZrnMRPI constructs using a BioRad Biolistic PDS-1000/He device. Gold 
particles (0.6nm, from BioRad, UK) were coated with the DNA plasmid mixture 
including 2.5g of p35S:MRP1 derived plasmid and 2.5g of ZmMEG1-1 
promoter deletion. Tissues were positioned 6 cm from the microcarrier stopping 
5 screen itself located 5 cm below the 6.2 Mpa rupture disc. 

After bombardment, samples were incubated for 24h at 26°C in the 
dark on solid medium (Murashige and Skoog, 1962)containing 100 mg/l myo- 
inositol, 2 g/l glutamine, 30g/l sucrose, and Murashige and Skoog vitamins 
(Sigma, UK). 

10 A minimum of three independent experiments were conducted for each 

promoter deletion made. 

Histochemical detection of GUS expression was performed by staining 

the leaf discs according to (Jefferson, 1989) with modifications. Leaf pieces 

were stained in a medium containing 0.5 mg/ml Xglucuronide (Clonetech, Palo 
15 Alto, CA), 0.5mM phosphate buffer, pH 7.0, 0.1% triton X-100 and 20% (v/v) 

methanol. 

It has been found that removal of the distalmost portion of the 
promoter (-370 to -240 region) disabled transactivation of pZmMEGI GUS by 
ZrnMRPI. 

20 To determine whether this minimal -370 to -250 promoter region 

(Figure 12) was sufficient to allow transactivation by ZrnMRPI, either alone or 
in combination with other regions within the conserved ZmMEG1-1 promoter, 
deletions were performed in the reverse orientation. 

From this analysis, the -370 to -250 minimal promoter region in 

25 combination with the putative TATA box region (-127 to 1) was deemed 
sufficient to confer transactivation of the ZmMEG1-1 promoter by ZrnMRPI . 

Taken together, these results conclusively demonstrate that ZrnMRPI 
activates ZmMEG1-1 expression by transactivation of the -370 to -250 
conserved domain also present in promoters of many BETL-specific genes. 

30 The same results are obtained with the -345 to -251 region (95 

nucleotides in length) that corresponds to nucleotides +26 to +120 of SEQ ID 
No:4 and with the fragment from nucleotide +29 to +119 of SEQ ID No:4 (91 
nucleotides in length). 
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EXAMPLE 8 : Expression in the BETL area of genes of interest 

5 SEQ ID No: 1 and SEQ ID No: 62 have 90 % identity and 6,7% of gaps 

after alignment on the basis of SEQ ID No: 1. This differences correspond to 
deletions, insertions, and base replacements that do not affect the functionality 
of the sequence so that same results are obtained with SEQ ID No: 1 and SEQ 
ID No: 62. 

10 

A MEG1 promoter according to the invention is useful to express from 
an early stage of BETL development proteins or RNA that result in desirable 
changes in seed development or quality. Such proteins or RNA may for 
example increase the size of the BETL layer and/or increase its capacity to 

15 import nutrients into the developing seed. Alternatively such proteins or RNA 
may improve resistance to pathogens and thus better protect the developing 
seed from transfer of pathogens from maternal tissues resulting in seeds (and 
progeny plants) of higher quality and vigour. 

As an example of the first utility Meg1 promoter (ZmMEG1-1) is linked 

20 to the cell wall invertase incW2 and as an example of the second utility Meg1 
promoter (ZmMEG1-1) is linked to the anti-fungal protein BAP2. 

8.11 Overexoression of lncW2 in the BETL 

lncW2 is naturally expressed in the BETL. Its importance in the 
25 determination of seed size was demonstrated via the analysis of lncW2 

mutants which exhibit a miniature kernel phenotype (Cheng et al (1996) Plant 

CellJun;8(6):971~983)). 

Meg1-1 promoter is fused to the coding region of lncW2 and cloned 

into a binary vector for agrobacterial mediated transformation of maize. 
30 Analysis of the transformed plants indicates that some plants overexpress 

lncW2 and this expression is earlier than normal in seed development These 

plants possess seeds that are larger than segregant seeds that lack the 

transgene which have normal levels of lncW2. 
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The same results are obtained with a construct comprising the MEG1 
promoter as defined in SEQ ID No: 62. 

8.2) Overexpression of BAP2 in the BETL 

5 BAP2 is naturally expressed in the BETL. Its importance in resistance 

of the developing seed to fungal attack was strongly suggested in studies 
where the isolated protein demonstrated significant anti-fungal activity in vitro 
(Semaetal(2001)). 

Meg1-1 promoter is fused to the coding region of BAP2 and cloned 

10 into a binary vector for agrobacterial mediated transformation of maize. 
Analysis of the transformed plants indicates that some plants overexpress 
BAP2 and this expression is earlier than normal in seed development. These 
plants possess seeds that have improved resistance to pathogens compared to 
segregant seeds that lack the transgene which have normal levels of BAP2. 

15 The same results are obtained with a construct comprising the MEG1 

promoter as defined in SEQ ID No: 62. 

EXAMPLE 9 : Use of Meg1 promoter to improve the crop value 

20 

The Meg1-1 promoter is active earlier in the BETL if inherited from the 
female than from the male. Early overexpression of proteins in the BETL layer 
leading to a desirable kernel phenotype in some cases can disrupt the normal 
development of the kernel such that the germination of the kernel is reduced. 

25 For example early overexpression of the transactivator MRP1 (Gomez et al 
(2002)) in the BETL leads to seeds with a hypertrophied BETL layer. Since the 
BETL layer is important for nutrient flow into the developing endosperm, this 
hypertropy leads to the production of a larger endosperm and a larger seed. 
However in some genetic backgrounds or crops this hypertrophy gives a larger 

30 endosperm at the expense of embryo development. Thus although the resulting 
seed has a higher endosperm to embryo ratio and thus a higher value, the 
MPRP1 transgene is not utilisable due to poor seed germination. 
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Overexpression from an imprinted promoter, particularly such as 
ZmMEG1-1 , overcomes this problem. 

Maize plants were transformed with a construct comprising in the sens 
orientation the MEG1 promoter (as defined in SEQ ID No: 62), the ZmMRPI 
5 cDNA, and the AtSac66 terminator (Jenkins et al.1 999). 

Homozygous pMEG1:MRP1 plants are used as pollen donors in 
crosses to wild-type plants (poor germination of these plants is acceptable at 
the multiplication stage). All the hybrid progeny of this cross will have normal 
kernels since the late paternal expression of MRP1 has little effect on BETL 
10 size. The hybrid progeny of this cross will have 50% of kernels with increased 
endosperm size, enough to significantly improve the crop value (Figure 13). 

The same results are obtained with a construct comprising the MEG1 
promoter as defined in SEQ ID No: 1 (pMEG1-1). 
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